Learning to Segment Instances in Videos with Spatial Propagation Network

نویسندگان

Jingchun Cheng

Sifei Liu

Yi-Hsuan Tsai

Wei-Chih Hung

Shalini De Mello

Jinwei Gu

Jan Kautz

Shengjin Wang

Ming-Hsuan Yang

چکیده

We propose a deep learning-based framework for instance-level object segmentation. Our method mainly consists of three steps. First, We train a generic model based on ResNet-101 for foreground/background segmentations. Second, based on this generic model, we fine-tune it to learn instance-level models and segment individual objects by using augmented object annotations in first frames of test videos. To distinguish different instances in the same video, we compute a pixel-level score map for each object from these instance-level models. Each score map indicates the objectness likelihood and is only computed within the foreground mask obtained in the first step. To further refine this per frame score map, we learn a spatial propagation network. This network aims to learn how to propagate a coarse segmentation mask spatially based on the pairwise similarities in each frame. In addition, we apply a filter on the refined score map that aims to recognize the best connected region using spatial and temporal consistencies in the video. Finally, we decide the instance-level object segmentation in each video by comparing score maps of different instances.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classification of encrypted traffic for applications based on statistical features

Traffic classification plays an important role in many aspects of network management such as identifying type of the transferred data, detection of malware applications, applying policies to restrict network accesses and so on. Basic methods in this field were using some obvious traffic features like port number and protocol type to classify the traffic type. However, recent changes in applicat...

متن کامل

Classification of Background Subtracted Videos Using Neural Network-Learning Classifier

In this project we present the concept of effective classification for background subtracted videos by using learning classifier-feed forward neural network with back propagation to conquer the open problem in the context of the complex scenarios.eg:while picturing the videos in some application like cloudy (or) misty areas the object in video will be less clarity with naked eye even after the ...

متن کامل

A Differential Evolution and Spatial Distribution based Local Search for Training Fuzzy Wavelet Neural Network

Abstract Many parameter-tuning algorithms have been proposed for training Fuzzy Wavelet Neural Networks (FWNNs). Absence of appropriate structure, convergence to local optima and low speed in learning algorithms are deficiencies of FWNNs in previous studies. In this paper, a Memetic Algorithm (MA) is introduced to train FWNN for addressing aforementioned learning lacks. Differential Evolution...

متن کامل

Temporal Segment Networks for Action Recognition in Videos

Deep convolutional networks have achieved great success for image recognition. However, for action recognition in videos, their advantage over traditional methods is not so evident. We present a general and flexible video-level framework for learning action models in videos. This method, called temporal segment network (TSN), aims to model long-range temporal structures with a new segment-based...

متن کامل

ANN-DEA Approach of Corporate Diversification and Efficiency in Bursa Malaysia

There is little consensus on the corporate diversification-efficiency relationship in the diversification literature. According to the corporate diversification, firms have a tendency to get more market share with diversifying in the local segment or in the international market. Theoretically, a contradictory exists between the profitable strategy and the value reducing strategy in the diversif...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1709.04609 شماره

صفحات -

تاریخ انتشار 2017

Learning to Segment Instances in Videos with Spatial Propagation Network

نویسندگان

چکیده

منابع مشابه

Classification of encrypted traffic for applications based on statistical features

Classification of Background Subtracted Videos Using Neural Network-Learning Classifier

A Differential Evolution and Spatial Distribution based Local Search for Training Fuzzy Wavelet Neural Network

Temporal Segment Networks for Action Recognition in Videos

ANN-DEA Approach of Corporate Diversification and Efficiency in Bursa Malaysia

عنوان ژورنال:

اشتراک گذاری